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The MAILING DATE of this communication appears on the cover sheet with the correspondence address 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). in no event, however, may a reply be timely fried 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 
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earned patent term adjustment. See 37 CFR 1 .704(b). 
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2a)D This action is FINAL. 2b)^ This action is non-final. 
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Claim Rejections - 35 USC § 103 



The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

Claims 1-20 rejected under 35 U.S.C. 103(a) as being unpatentable over Woodfill, and 
further in view of Moezzi. 
1. Claim 1. 

"A method of aligning a first image to a second image, comprising: determining a first alignment 
approximation, based on distances between one or more points in the first image and the second 
image, aligning the second image to the first image, based on the first alignment approximation, 
to form an initially aligned second image, determining a second alignment approximation, based 
on distances between one or more points in the first image and the initially aligned second 
image, and aligning the second image to the first image, based on a combination of the first and 
second alignment approximation", Woodfill teaches in (col. 12, lines 57-67) the single camera 
may or may not be in motion . Thus, distinct images can be identified by their space/time 
attributes. Using a single camera, the "left 11 image may correspond to an image captured at one 
time, and the "right" image may correspond to an image captured at another time. The analysis 
then involves comparing successive frames; that is, if a, b, c, and d represent successive frames 
of images captured by the single camera, a and b are compared, then b and c, then c and d, and so 
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on. Similarly, the single camera may shift or move between two distinct positions (i.e., left 
position and right position) back and forth and the captured images are appropriately designated 
or assigned to either the left or right image. Woodfill does not explicitly specify the alignment 
based on distance between one or more points in the first image and the second image, however 
Moezzi teaches in (col. 28, lines 54-56) all supporting observations are used with appropriate 
weighting based on distance from the camera (distance between one or more points in the 
images) , direction of motion, and to update the position of each object. The step of second 
alignment based on distances is obvious because the process will continue for each frame. 
Woodfill teaches in (col. 32, lines 20-22) the computations will involve a combination of 
additions and subtractions of previously calculated arrays and image data. 
Thus, it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to incorporate the teaching of Moezzi into Woodfill in order to improve the lack of 
computation power. 
2. Claim 2. 

"The method of claim 1, wherein aligning the second image to the first image based on the 
combination of the first and second alignment approximations is effected by: aligning the 
initially aligned second image, which is based on the first alignment approximation, to the first 
image, based on the second alignment approximation", Woodfill teaches in (col. 12, lines 57-67) 
the single camera may or may not be in motion. Thus, distinct images can be identified by their 
space/time attributes. Using a single camera, the "left" image may correspond to an image 
captured at one time, and the "right" image may correspond to an image captured at another time. 
The analysis then involves comparing successive frames; that is, if a, b, c, and d represent 
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successive frames of images captured by the single camera, a and b are compared, then b and c, 
then c and d, and so on. Similarly, the single camera may shift or move between two distinct 
positions (i.e., left position and right position) back and forth and the captured images are 
appropriately designated or assigned to either the left or right image. 

3. Claim 3. 

"The method of claim 1, wherein, determining the first alignment approximation is based on a 
low-resolution representation of the first and second images, and determining the second 
alignment approximation is based on a higher-resolution representation of the first and second 
images", Woodfill does not explicitly specify the higher-resolution representation of images, 
however Moezzi teaches in (col. 33, lines 21-26) the model shown in FIGS. 6, 7 and 8 employs 
an (x, y, z) world coordinate, bounding box object representation. That is, the system track 
object centroid and uses a bounding box to indicate presence of an object at a particular location. 
A voxel-based representation supports finer resolution of object shape and location. 
Thus, it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to incorporate the teaching of Moezzi into Woodfill in order to improve the lack of 
computation power. 

4. Claim 4. 

"The method of claim 1, wherein determining at least one of the first alignment and second 
alignment approximations includes applying the RANSAC algorithm", the step is obvious 
because the RANSAC algorithms is well known in the art, however applicant should illustrates 
the calculations, variables and interpretation of data more in detail. 

5. Claim 5. 
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"The method of claim 1, wherein determining the first alignment approximation includes an 
approximation of at least one of a rotation component and a translation component in an image 
space of the first and second images", Woodfill does not explicitly specify the higher-resolution 
representation of images, however Moezzi teaches in (col. 33, lines 21-26) the model shown in 
FIGS. 6, 7 and 8 employs an (x, y, z) world coordinate, bounding box object representation. 
That is, the system track object centroid and uses a bounding box to indicate presence of an 
object at a particular location. A voxel-based representation supports finer resolution of object 
shape and location. Thus, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to incorporate the teaching of Moezzi into Woodfill in order to 
improve the lack of computation power. 

6. Claim 6. 

"The method of claim 5, wherein determining the second alignment approximation includes an 
approximation of components of a 3x3 homographic matrix", Woodfill does not explicitly 
specify the 3x3 homographic matrix, however Moezzi teaches in (col. 35, lines 18-25) where M v 
is the 4x4 homogeneous transformation matrix representing transformation between V and the 
world W. Thus, it would have been obvious to one of ordinary skill in the art at the time the 
invention was made to incorporate the teaching of Moezzi into Woodfill in order to improve the 
lack of computation power. 

7. Claim 7. 

"The method of claim 1, wherein determining the second alignment approximation includes an 
approximation of components of a 3x3 homographic matrix", Woodfill does not explicitly 
specify the 3x3 homographic matrix, however Moezzi teaches in (col. 35, lines 18-25) where M v 
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is the 4x4 homogeneous transformation matrix representing transformation between V and the 
world W. Thus, it would have been obvious to one of ordinary skill in the art at the time the 
invention was made to incorporate the teaching of Moezzi into Woodfill in order to improve the 
lack of computation power. 

8. Claim 8. 

"The method of claim 1, wherein determining at least one of the first and second alignment 
approximations includes identifying corners in the first and second images based on a 
determination of Minimum Intensity Changes at the corners", the step is obvious because the 
comparison of different frame of images will provide alignment approximation to identify 
corners. 

9. Claim 9. 

"A method of tracking an object based on a first image and a second image, comprising: aligning 
the first and second images to form a set of aligned images, and detecting motion by comparing 
the set of aligned images, wherein aligning the first and second images includes: determining a 
first alignment approximation, based on distances between one or more points in the first image 
and the second image, aligning the second image to the first image, based on the first alignment 
approximation, to form an initially aligned second image, determining a second alignment 
approximation, based on distances between one or more points in the first image and the initially 
aligned second image, and aligning the second image to the first image, based on a combination 
of the first and second alignment approximations", Woodfill teaches in (col. 12, lines 57-67) the 
single camera may or may not be in motion . Thus, distinct images can be identified by their 
space/time attributes. Using a single camera, the "left" image may correspond to an image 
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captured at one time, and the "right" image may correspond to an image captured at another time. 
The analysis then involves comparing successive frames; that is, if a, b, c, and d represent 
successive frames of images captured by the single camera, a and b are compared, then b and c, 
then c and d, and so on. Similarly, the single camera may shift or move between two distinct 
positions (i.e., left position and right position) back and forth and the captured images are 
appropriately designated or assigned to either the left or right image. Woodfill does not explicitly 
specify the alignment based on distance between one or more points in the first image and the 
second image, however Moezzi teaches in (col. 28, lines 54-56) all supporting observations are 
used (with appropriate weighting based on distance from the camera, direction of motion, etc.) to 
update the position of each object. The step of second alignment based on distances is obvious 
because the process will continue for each frame. Woodfill teaches in (col. 32, lines 20-22) the 
computations will involve a combination of additions and subtractions of previously calculated 
arrays and image data. 

Thus, it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to incorporate the teaching of Moezzi into Woodfill in order to improve the lack of 
computation power. 
10. Claim 10. 

"The method of claim 9, wherein determining the first alignment approximation is based on a 
low-resolution representation of the first and second images, and determining the second 
alignment approximation is based on a higher-resolution representation of the first and second 
images", Woodfill does not explicitly specify the higher-resolution representation of images, 
however Moezzi teaches in (col. 33, lines 21-26) the model shown in FIGS. 6, 7 and 8 employs 
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an (x, y, z) world coordinate, bounding box object representation. That is, the system track 
object centroid and uses a bounding box to indicate presence of an object at a particular location. 
A voxel-based representation supports finer resolution of object shape and location. 
Thus, it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to incorporate the teaching of Moezzi into Woodfill in order to improve the lack of 
computation power. 

11. Claim 11. 

"The method of claim 9, further including identifying the object in the set of aligned images 
based on color matching", the step is obvious because the aligned images should be based on 
color and intensity of pixels. Woodfill teaches in (col. 12, lines 57-67) the single camera may or 
may not be in motion. Thus, distinct images can be identified by their space/time attributes. 
Using a single camera, the "left" image may correspond to an image captured at one time, and 
the "right" image may correspond to an image captured at another time. The analysis then 
involves comparing successive frames; that is, if a, b, c, and d represent successive frames of 
images captured by the single camera, a and b are compared, then b and c, then c and d, and so 
on. Similarly, the single camera may shift or move between two distinct positions (i.e., left 
position and right position) back and forth and the captured images are appropriately designated 
or assigned to either the left or right image. 

12. Claim 12. 

"The method of claim 9, further including determining a location of the object in each image of 
the set of aligned images, and determining a movement of the object by comparing the location 
of the object in each image", Woodfill does not explicitly specify the higher-resolution 
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representation of images, however Moezzi teaches in (col. 33, lines 21-26) the model shown in 
FIGS. 6, 7 and 8 employs an (x, y, z) world coordinate, bounding box object representation. 
That is, the system track object centroid and uses a bounding box to indicate presence of an 
object at a particular location. A voxel-based representation supports finer resolution of object 
shape and location. Thus, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to incorporate the teaching of Moezzi into Woodfill in order to 
improve the lack of computation power. 
13. Claim 13. 

"A motion detecting system comprising: a processor that is configured to: align a first image and 
a second image, to form a set of aligned images, by: determining a first alignment 
approximation, based on distances between one or more points in the first image and the second 
image, aligning the second image to the first image, based on the first alignment approximation, 
to form an initially aligned second image, determining a second alignment approximation, based 
on distances between one or more points in the first image and the initially aligned second 
image, and aligning the second image to the first image, based on a combination of the first and 
second alignment approximations; and compare the set of aligned images to identify motion of 
objects within the first and second images", Woodfill teaches in (col. 12, lines 57-67) the single 
camera may or may not be in motion . Thus, distinct images can be identified by their 
space/time attributes. Using a single camera, the "left" image may correspond to an image 
captured at one time, and the "right" image may correspond to an image captured at another time. 
The analysis then involves comparing successive frames; that is, if a, b, c, and d represent 
successive frames of images captured by the single camera, a and b are compared, then b and c, 
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then c and d, and so on. Similarly, the single camera may shift or move between two distinct 
positions (i.e., left position and right position) back and forth and the captured images are 
appropriately designated or assigned to either the left or right image. Woodfill does not explicitly 
specify the alignment based on distance between one or more points in the first image and the 
second image, however Moezzi teaches in (col. 28, lines 54-56) all supporting observations are 
used (with appropriate weighting based on distance from the camera, direction of motion, etc.) to 
update the position of each object. The step of second alignment based on distances is obvious 
because the process will continue for each frame. Woodfill teaches in (col. 32, lines 20-22) the 
computations will involve a combination of additions and subtractions of previously calculated 
arrays and image data. 

Thus, it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to incorporate the teaching of Moezzi into Woodfill in order to improve the lack of 
computation power. 
14. Claim 14. 

"The motion detecting system of claim 13, wherein the processor is configured to: determine the 
first alignment approximation by processing a low-resolution representation of at least one of the 
first and second images, and determine the second alignment approximation by processing a 
higher-resolution representation of the first and second images", Woodfill does not explicitly 
specify the higher-resolution representation of images, however Moezzi teaches in (col. 33, lines 
21-26) the model shown in FIGS. 6, 7 and 8 employs an (x, y, z) world coordinate, bounding box 
object representation. That is, the system track object centroid and uses a bounding box to 
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indicate presence of an object at a particular location. A voxel-based representation supports 
finer resolution of object shape and location. 

Thus, it would have been obvious to one of ordinary skill in the art at the time the invention was 
made to incorporate the teaching of Moezzi into Woodfill in order to improve the lack of 
computation power. 

15. Claim 15. 

"The motion detecting system of claim 13, further including one or more cameras for producing 
the first and second images", Woodfill illustrates in Fig. 1 . 

16. Claim 16. 

"The motion detecting system of claim 13, further including a memory for storing a 
representation of a target image, and wherein the processor is further configured to identify a 
target within the set of aligned images, based on the representation of the target image", 
Woodfill illustrates in Fig. 65. And also Woodfill teaches in (col. 8, lines 62-67) the hardware 
aspect of the present invention represents a parallel pipelined computing system designed to 
perform data set comparisons efficiently and at low cost. Data is processed in a systolic nature 
through the pipeline. This image processing system provides high performance via high 
computational density, high memory bandwidth, and high I/O bandwidth. 

17. Claim 17. 

"The motion detecting system of claim 16, wherein the representation of the target image is a 
characterization based on color content of the target image", the step is obvious because the 
aligned images should be based on color and intensity of pixels. Woodfill teaches in (col. 12, 
lines 57-67) the single camera may or may not be in motion. Thus, distinct images can be 
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identified by their space/time attributes. Using a single camera, the "left" image may correspond 
to an image captured at one time, and the "right" image may correspond to an image captured at 
another time. The analysis then involves comparing successive frames; that is, if a, b, c, and d 
represent successive frames of images captured by the single camera, a and b are compared, then 
b and c, then c and d, and so on. Similarly, the single camera may shift or move between two 
distinct positions (i.e., left position and right position) back and forth and the captured images are 
appropriately designated or assigned to either the left or right image. 

18. Claim 18. 

"The motion detecting system of claim 13, further including determining a location of an object 
in each image of the set of aligned images, and determining a movement of the object by 
comparing the location of the object in each image", Woodfill teaches in (col. 12, lines 57-67) 
the single camera may or may not be in motion. Thus, distinct images can be identified by their 
space/time attributes. Using a single camera, the "left" image may correspond to an image 
captured at one time, and the "right" image may correspond to an image captured at another time. 
The analysis then involves comparing successive frames; that is, if a, b, c, and d represent 
successive frames of images captured by the single camera, a and b are compared, then b and c, 
then c and d, and so on. Similarly, the single camera may shift or move between two distinct 
positions (i.e., left position and right position) back and forth and the captured images are 
appropriately designated or assigned to either the left or right image. 

19. Claim 19. 

"The motion detecting system of claim 13, wherein determining the first alignment 
approximation includes an approximation of at least one of a rotation component and a 
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translation component", Woodfill teaches in (col. 12, lines 57-67) the single camera may or may 
not be in motion. Thus, distinct images can be identified by their space/time attributes. Using a 
single camera, the "left" image may correspond to an image captured at one time, and the "right" 
image may correspond to an image captured at another time. The analysis then involves 
comparing successive frames; that is, if a, b, c, and d represent successive frames of images 
captured by the single camera, a and b are compared, then b and c, then c and d, and so on. 
Similarly, the single camera may shift or move between two distinct positions (i.e., left position 
and right position) back and forth and the captured images are appropriately designated or 
assigned to either the left or right image. 
20. Claim 20. 

"The motion detecting system of claim 19, wherein determining the second alignment 
approximation includes an approximation of components of a 3x3 homographic matrix", 
Woodfill does not explicitly specify the 3x3 homographic matrix, however Moezzi teaches in 
(col. 35, lines 18-25) where M v is the 4x4 homogeneous transformation matrix representing 
transformation between V and the world W. Thus, it would have been obvious to one of ordinary 
skill in the art at the time the invention was made to incorporate the teaching of Moezzi into 
Woodfill in order to improve the lack of computation power. 




Application/Control Number: 09/854,1 19 
Art Unit: 2672 



Page 14 



Conclusion 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Javid A Amini whose telephone number is 703-605-4248. The 
examiner can normally be reached on 8-4pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Michael Razavi can be reached on 703-305-4713. The fax phone numbers for the 
organization where this application or proceeding is assigned are 703-746-8705 for regular 
communications and 703-746-8705 for After Final communications. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the receptionist whose telephone number is 703-306-0377. 



Javid A Amini 
Examiner 
Art Unit 2672 



Javid Amini 



April 17, 2003 
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